To test or not to test: Preliminary assessment of normality when comparing two independent samples
نویسندگان
چکیده
BACKGROUND Student's two-sample t test is generally used for comparing the means of two independent samples, for example, two treatment arms. Under the null hypothesis, the t test assumes that the two samples arise from the same normally distributed population with unknown variance. Adequate control of the Type I error requires that the normality assumption holds, which is often examined by means of a preliminary Shapiro-Wilk test. The following two-stage procedure is widely accepted: If the preliminary test for normality is not significant, the t test is used; if the preliminary test rejects the null hypothesis of normality, a nonparametric test is applied in the main analysis. METHODS Equally sized samples were drawn from exponential, uniform, and normal distributions. The two-sample t test was conducted if either both samples (Strategy I) or the collapsed set of residuals from both samples (Strategy II) had passed the preliminary Shapiro-Wilk test for normality; otherwise, Mann-Whitney's U test was conducted. By simulation, we separately estimated the conditional Type I error probabilities for the parametric and nonparametric part of the two-stage procedure. Finally, we assessed the overall Type I error rate and the power of the two-stage procedure as a whole. RESULTS Preliminary testing for normality seriously altered the conditional Type I error rates of the subsequent main analysis for both parametric and nonparametric tests. We discuss possible explanations for the observed results, the most important one being the selection mechanism due to the preliminary test. Interestingly, the overall Type I error rate and power of the entire two-stage procedure remained within acceptable limits. CONCLUSION The two-stage procedure might be considered incorrect from a formal perspective; nevertheless, in the investigated examples, this procedure seemed to satisfactorily maintain the nominal significance level and had acceptable power properties.
منابع مشابه
Preliminary Tests of Normality When Comparing Three Independent Samples
This paper uses simulation to explore the performance of a two-stage procedure where a preliminary Shapiro-Wilk test is used to choose between the ANOVA and Kruskal-Wallis tests as a three-sample location test. The results suggest that the two-stage procedure actually seems to be preferable when conducting such location tests.
متن کاملThe Two-Sample t-Test and Randomization Test
Many Six Sigma practitioners use “Student’s” independent two-sample t-test when investigating differences in means. This type of test is based upon drawing random samples from two independent normal (Gaussian) distributions. Often, however, the assumption of normality and/or random sampling is violated. This article discusses when and why the two-sample t-test may be considered robust to these ...
متن کاملNonparametric Methods for Two Samples
• In the independent two-sample t-test, we assume normality, independence, and equal variances. • This t-test is robust against nonnormality, but is sensitive to dependence. • If n1 is close to n2, then the test is moderately robust against unequal variance (σ 1 6= σ 2). But if n1 and n2 are quite different (e.g. differ by a ratio of 3 or more), then the test is much less robust. • How to deter...
متن کاملThe Effect of Item Modality and Note-taking on EFL Learners’ Performance on a Listening Test
The pivotal role of listening comprehension in second/foreign language learning requires that researchers conduct studies which investigate factors that affect test takers’ performances. The present study was set out to examine whether item modality (i.e., written vs. oral items) affects listening comprehension test performance. In addition, it investigated whether allowing test takers to take ...
متن کاملDetermining the sample size required to compare vegetation and soil characteristics in two independent groups using effect size
Extended Abstract Background and objectives: One of the important steps in assessing rangeland vegetation is determining the sample size. Adequacy of sample size and its determination is always one of the main concerns of rangeland vegetation analyzer. There are two general methods for determining the sample size in rangeland science: graphic and statistical methods. In this study, the sample...
متن کامل